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ABSTRACT 

Radial velocity surveys are beginning to reach the time baselines required to detect Jupiter 
analogs, as well as sub-Saturn mass planets in close orbits. Therefore it is important to un- 
derstand the sensitivity of these surveys at long periods and low amplitudes. In this paper, I 
derive analytic expressions for the detectability of planets at both short and long periods, for 
circular and eccentric orbits. In the long period regime, the scaling of the detection threshold 
with period depends on the desired detection efficiency. The 99% velocity threshold scales as 
K (X (X a^, whereas the 50% velocity threshold scales as K oc P oc a?/"^. I suggest an 
extension of the Lomb-Scargle statistic to Keplerian orbits, and describe how to estimate the 
false alarm probability associated with a Keplerian fit. I use this Keplerian periodogram to 
investigate the effect of eccentricity on detectability. At short periods, detectability is reduced 
for eccentric orbits, mainly due to the sparse sampling of the periastron passage, whereas 
long period orbits are easier to detect on average if they are eccentric because of the steep 
velocity gradients near periastron. Fitting Keplerian orbits allows the lost sensitivity at short 
orbital periods to be recovered for e < 0.6, however there remain significant selection ef- 
fects against eccentric orbits for e > 0.6, and the small number of highly eccentric planets 
discovered so far may reflect this. Finally, I present a Bayesian approach to the periodogram 
which gives a simple derivation of the probability distributions of noise powers, clarifies why 
the periodogram is an appropriate way to search for long period signals, and emphasises the 
equivalence of periodogram and least squares techniques. 
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1 INTRODUCTION 

Precise radial velocity surveys have made detection of Jupiter mass 
companions to nearby stars routine, with more than 100 such "ex- 
oplanets" now known (see Marcy et al. 2003 for a review). As 
these surveys continue, the accessible parameter space grows to- 
wards lower masses and longer orbital periods. For example, re- 
cent observations have led to the discovery of Saturn mass planets 
in close orbits (Fischer et al. 2003), and a population of Jupiters 
with nearly-circular orbits at distances > 1 AU (Vogt et al. 2002; 
Carter et al. 2003; Jones et al. 2003). In addition, it is now possible 
to study the statistical occurrence rate and distributions of mass, pe- 
riod, and eccentricity of exoplanets (Vogt et al. 2000; Tabachnik & 
Tremaine 2002; Butler et al. 2003; Fischer et al. 2003; Lineweaver 
& Grether 2003; Jones et al. 2003; Udry, Mayor, & Santos 2003), 
and how these depend on the metallicity of the host stars (Fischer 
& Valenti 2003). These distributions contain important information 
about the planet formation process (Armitage et al. 2002; Ida & Lin 
2004). 

This recent work emphasises the need to understand the sen- 
sitivity of radial velocity surveys at long periods and low am- 
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plitudes. The sensitivity of radial velocity surveys has been dis- 
cussed previously by several authors, mostly for circular orbits. The 
Lomb-Scargle periodogram (Lomb 1976; Scargle 1982) is a com- 
monly used technique for searching for periodic sinusoidal signals 
in unevenly-sampled data, and allows analytic estimates of the de- 
tection threshold to be written down for periods less than the du- 
ration of the observations (Horne & Baliunas 1986). This was ap- 
plied to planet searches by Cochran & Hatzes (1996) and Nelson & 
Angel (1998). Endl et al. (2002) briefly investigated the detectabil- 
ity of eccentric orbits with the LS periodogram, finding that de- 
tectability was significantly reduced for e > 0.5. The detectability 
of long period planets, with orbital periods longer than the duration 
of the observations, was studied by Nelson & Angel (1998) and 
Eisner & Kulkami (2001). Rather than use the Lomb-Scargle peri- 
odogram to measure the significance of the fit, Nelson & Angel 
(1998) adopted the square of the best fitting amplitude . Eisner 
& Kulkami (2001) pointed out that this leads to reduced sensitiv- 
ity at long periods, because when fitting sinusoids to noise only, 
one finds that the amplitude is strongly correlated with the fitted 
phase. They adopted an "amplitude-phase" analysis to account for 
this correlation, and showed that the sensitivity was significantly 
improved. 

In this paper, I revisit the question of the sensitivity of radial 
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velocity surveys. The main motivations are first to address liow to 
assess the false alarm probability associated with a Keplerian orbit 
fit to radial velocity measurements, and secondly to derive simple 
analytic formulae for detection thresholds, including long orbital 
periods and non-zero orbital eccentricities. I show that significant 
selection effects operate against highly eccentric orbits, and that at 
long periods, a seperate analysis of amplitude and phase is not re- 
quired: the definition of the Lomb-Scargle periodogram in terms of 
Ax^ automatically accounts for the correlations between fitted pa- 
rameters at long periods. In addition, I discuss a Bayesian approach 
to this problem which will provide a useful basis for future study of 
the statistical distributions of extrasolar planet properties, as well as 
emphasising the fundamental equivalence of periodogram and least 
squares techniques. This equivalence, and the idea of extending the 
periodogram to non-sinusoidal signals have also been discussed in 
the literature on Bayesian statistics (see Bretthorst 1988, 2001a,b,c; 
Scargle 2002; Loredo & Chernoff 2003). 

An outline of the paper is as follows. In §2, 1 review the Lomb- 
Scargle periodogram, and suggest an extension to Keplerian orbits. 
In §3, I outline a Bayesian approach to the periodogram. In §4, I 
calculate the detection thresholds for short and long periods, and 
including non-zero eccentricity. The conclusions are presented in 
§5. 



2 LEAST SQUARES FITTING AND THE 
PERIODOGRAM 

2.1 Sources of Radial Velocity Variability 

We are interested in detecting the radial velocity wobble due to an 
orbiting planet given a set of measured radial velocities, observa- 
tion times, and measurement errors. Often the first indication of the 
presence of a planet is excess scatter in the radial velocities over 
the expected amount. A simple way to check for this is to calculate 
the probability that Xmcan = X/ (^i ~ /'^'^ drawn from a 
chi-squared distribution (the r(i//2, 1/2) distribution; Hoel, Port, 
& Stone 1971; Press et al. 1992). Here, Oi is the expected variabil- 
ity for data point i, and {v) is the mean of the data. A reduced 
much greater than 1 (the exact threshold depending on the desired 
false alarm probability) indicates excess variability. 

However, there is some uncertainty in predicting the expected 
variability in the radial velocities. Scatter in the radial velocities is 
expected from statistical and systematic measurement errors, and 
from intrinsic stellar radial velocity variations, or "jitter". The typi- 
cal measurement error depends on the survey, but is typically 3- 
5 m/s, and promises to improve towards ~ 1 m/s in the near 
future (Mayor et al. 2003; Butler et al. 2004). Jitter is thought to 
arise from a combination of surface convective motions, magnetic 
activity, and rotation (Saar & Donahue 1997). The amount of jit- 
ter depends on stellar properties such as rotation rate and spec- 
tral type, but is typically 3-5 m/s for chromospherically quiet 
stars (Saar, Butler, & Marcy 1998; Santos et al. 2000). Saar et 
al. (1998) used data from the Lick survey to find a rough relation 
(Jv ~ 5 m/s (23 d/Prot)^'^ for G and K type stars (Cumming, 
Marcy, & Butler 1999, hereafter CMB99). 

In this paper, we will generally ignore these uncertainties by 
assuming that the noise level is unknown, and looking for the best 
partitioning of the data into Gaussian noise plus a single planet on a 
circular or Keplerian orbit. However, we will also discuss the case 
where the noise o can be predicted in advance. As we will show. 



this issue is only important for small A*', when the noise level is 
difficult to accurately determine from the data. 



2.2 Circular Orbits: tlie Lomb-Scargle Periodogram 

We first discuss the Lomb-Scargle (LS) periodogram (Lomb 1976; 
Scargle 1982). Since it involves fitting sinusoids to the data, this is 
particularly appropriate for circular orbits. Given a set of observa- 
tion times {tj }, velocities {vj }, and measurement errors {uj }, and 
a trial orbital frequency uj — 2n/P,we fit the function 

fj=Acosujtj+BsinLotj+C (I) 

to the data by minimising x^, which we write as Xcirc ~ (^j ~ 
/j)^/""! ■ The number of degrees of freedom is u — N ~ 3, since 
there are three parameters (A,B,C) in the model. Here, we have 
extended the original Lomb-Scargle periodogram by allowing the 
mean to float at each frequency (Walker et al. 1995; Nelson & An- 
gel 1998; CMB99), rather than subtracting the mean of the data 
prior to the fit (the importance of this is discussed by CMB99 and 
Black & Scargle 1982). 

The goodness of fit is measured by the LS periodogram power 
z, defined as 



z{uj) 



AxV2 

A-circ/ 



(2) 



where Ax^ ~ X 



2 

mean 



2 H ^ 

Xcirc ' ana x mi 



is the x^ of a fit of a constant to the data. The periodogram power 
z{lo) measures how much the x^ of the fit improves when a si- 
nusoid of frequency ui is included. As emphasised by Walker et 
al. (1995), this is similar to a classical F-test for comparing fits 
of different models to data (e.g. Bevington & Robinson 1992). To 
search for a periodicity, we evaluate z(lo) for a range of frequen- 
cies, and look for the maximum value zq. 

The significance of the best fit depends on the "false alarm 
probability", or how often a periodogram power as large as the ob- 
served power would arise purely due to noise alone. For a single 
frequency search, the probability that the power at a given fre- 
quency exceeds the value zq is well-determined analytically for 
Gaussian noise; it is 



Prob(z > zo) — 



-I//2 



(3) 



which is the cumulative probability arising from the F2,n-3 distri- 
bution (Schwarzenberg-Czerny 1998; see also CMB99, Appendix 
B for a summary). This distribution takes the simple form 



Prob(2 > Zq) — exp ( 



-20) 



(4) 



for large A''. 

Scargle (1982) defined the periodogram as z{uj) = Ax^/2, in 
which case equation J4j is valid for all A'^. This definition is appro- 
priate when we know the noise level a in advance, and is attractive 
because of the very simple probability distribution of equation J4}- 
However, in general the noise level is not known in advance as we 
discussed in §2.1, and must be estimated from the data (e.g. Home 
& Baliunas 1986). In this case, the periodogram must be "normal- 
ized" by an estimate of the noise obtained from the data. In equation 
j2j> the normalization factor is the x^ of the best-fitting sinusoid. 
The "normalized" periodogram acts to partition the data into two 
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pieces, signal plus noise, and determines the best fitting amplitude 
for each^ . 

For a search of many frequencies, each "independent fre- 
quency" must be counted as an individual trial. The false alarm 
probability is then 



f = 1 - [1 - Prob(z > zo) 



(5) 



where M is the number of independent frequencies, and zq is the 
observed power For small F, 



F ^ M Prob(z > zo) 



(F« 1). 



(6) 



The detection threshold Zd is the periodogram power correspond- 
ing to some (small) value of F (e.g. F = 0.01 for a 99% detection 
threshold), i.e. the value of z exceeded due to noise alone in only a 
small fraction F of trials. An observed power larger than Zd indi- 
cates that a signal is likely present. 

The remaining task is to determine the number of independent 
frequencies M. Whereas sines and cosines are orthogonal functions 
for evenly-sampled data, leading to a statistically independent set 
of frequencies, this is no longer the case for unevenly-sampled data. 
The seperation between peaks in the periodogram is l/T, giving a 
simple estimate of the number of independent frequencies M ~ 
TAf, where A/ = /2 — /i is the frequency range searched. Often 
/2 > /i, in which case M ^ f-zT. 

A better determination of AI is to use Monte Carlo simula- 
tions, in which data sets of noise only are generated, with veloci- 
ties either drawn from a Gaussian distribution, or selected with re- 
placement from the residuals about the mean (the so-called "boot- 
strapping" method, e.g. Press et al. 1992). The fraction of trials 
for which the maximum periodogram power exceeds the observed 
value gives the false alarm probability^ . 

It is worth emphasising the effect of the uneven sampling on 
the number of independent frequencies. For evenly-spaced data, 
the number of independent frequencies is N/2, ranging from l/T 
to the Nyquist frequency, /jvy = N/2T. For unevenly-sampled 
data. Home & Baliunas (1986) found that M ~ N for a search 
up to the Nyquist frequency (see also Press et al. 1992). This 
agrees with our simple estimate above since M ^ f2T ~ N/2. 
However, uneven sampling allows frequencies much higher than 
the Nyquist frequency to be searched (see discussion in Scargle 
1982 and Bretthorst 2001a). In general, M > iV, by a factor of 
/2//JV1/. For example, a set of 30 observations over 7 years has 
fffy « 1/(6 months). A search for periods as short as 2 days then 
has M « 85N « 2500. 



2.3 Generalization of the Periodogram to Keplerian Orbits 

The definition of the LS periodogram in equation J2j suggests an 
immediate generalization to eccentric orbits (which have 2 extra 
parameters, the longitude of pericenter Q, and eccentricity e). 



^ In fact, there was a debate in literature over the appropriate way to "nor- 
malize" the periodogram, whether by the variance of the data (Home & 
Baliunas 1986; Walker et al. 1995), or by the variance of the residuals to 
the best fit sinusoid (Gilliland & Baliunas 1987). Schwarzenberg-Czemy 
(1998) showed that in fact these normalizations are statistically equivalent, 
and we will return to this issue in §2.4. 

^ For Gaussian noise, the number of trials necessary to calculate F can be 
reduced by using the analytic form of the distribution given by equation |5). 
First, using AI as a free parameter, tit the analytic distribution to the distri- 
bution of noise powers from a small number of trials A'^tjiais- Then equation 
Isl with the fitted value of AI allows extrapolation to F 2> (1 /A'^trials)- 
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Figure 1. Distribution of periodogram powers for Gaussian noise. We take 
A'^ = 16 and search at a single frequency (top panel) and for periods in the 
range 1 day to 10 years (bottom panel). Symbols are the results of numeri- 
cal simulations with 10000 trials (top panel) or 2000 trials (bottom panel): 
circles are for the LS periodogram; open triangles are for the Keplerian pe- 
riodgram, fixing the period during the fit; solid triangles are the Keplerian 
periodogram with the period allowed to vary during the fit. Dotted and solid 
curves show the analytic distributions from equations J3} and JS}. For the 
lower panel, the number of independent frequencies needed to fit the nu- 
merical results is AI = 6000 for the LS periodogram, and AI = 1500 for 
the Keplerian periodogram. The duration of the data set is fa 1750 days, 
giving AI ft! 3TA/ and ft; TA/ respectively. 



,{uj) = 



(7) 



where Xkcp is the X of a fit of a Keplerian orbit to the data (with 
u = N — 5 degrees of freedom), Xmean the of a fit of a constant 
to the data, and Ax^ = Xmcan ~ XKcp- Again, the normalization 
factor in the denominator of equation can be dropped if the noise 
is known in advance. 
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Figure 2. The false alarm probability associated with a Keplerian fit to A'^ 
observations. Each curve shows the FAP from equation |8j, labelled by the 
value of Xmcan/XKop' curves are for A/ = 3000, dotted curves are 
for M = 1000. 



The complication in implementing equation is that Kep- 
lerian fits are nonlinear, involving relatively slow searches over a 
complex space. Therefore, we use the period and amplitude 
of the best fitting sinusoid obtained from the LS periodogram as 
an initial guess for the Keplerian fit. We then use a Levenberg- 
Marquardt scheme (e.g. Press et al. 1992) to find the minimum, 
trying several initial starting values for the phases and eccentricity. 
The minimum is then used to calculate from equation Q. 



2.4 The False Alarm Probability of a Keplerian Fit 

Given the redefinition of the periodogram power for Keplerian or- 
bits, the search for significant fits proceeds in a similar way as for 
circular orbits. For a given data set, we perform a wide frequency 
search with the LS periodogram, and then use the best fitting si- 
nusoids as starting points for Keplerian fits, calculating the peri- 
odogram power from equation 0. The significance of the resulting 
best fitting orbit is determined by a Monte Carlo method in which 
we make fake data sets containing noise only, and ask how often Ze 
exceeds the observed value. A similar calculation allows the detec- 
tion threshold corresponding to a given false alarm probability to 
be determined. 

As in the circular orbit case, there is an analytic estimate for 
the distribution of noise powers at a single frequency. The distri- 
bution of powers for Gaussian noise at a single frequency is given 
by the _F4,jv_5 distribution (Hoel, Port, & Stone 1971). Integrating 



this, we find 



Prob(z > zo) = 1 



u + 2 4zo 



'(i/+2)/2 



(8) 



^ Eq. Ill becomes Prob{2 > zq) = (1 + 2zo) exp (-2zo) for > ^o- 
Since this is also the cumulative distribution of this limit applies to 

the case where the noise is known in advance, with Zs defined without the 
normalizing XKcp/'^ factor 



where v — N — 5. 

The analytic distribution is compared to numerical calcula- 
tions in Figure Q The symbols are the results of numerical cal- 
culations for data sets with A*' = 16. The top panel is for a search 
at a single frequency, and the bottom panel is for a wide frequency 
search between periods of 1 day and 10 years. The results for the 
LS periodogram (circles), and the corresponding analytic distribu- 
tion (dotted curves) agree extremely well. For the Keplerian peri- 
odogram, we show results with the period held fixed during the fit, 
and with the period allowed to vary during the fit. The solid curves 
are the analytic distribution of equation Jsj. The agreement is good 
for the case where period is held fixed. When the period is allowed 
to vary during the fit, the powers are systematically larger, since the 
search algorithm is able to step off the period grid and find a better 
fit. For this example, the best-fit number of independent frequen- 
cies is M ~ 6000 for the LS periodogram, and M ~ 1500 for 
the Keplerian periodogram. This is roughly 3 times or equal to the 
simple estimate M — TA f respectively. 

The analytic distribution allows a simple method for estimat- 
ing the significance of a Keplerian fit to radial velocity data. First, 
given Xkcp from the Keplerian fit, and Xmcan from the fit of a con- 
stant to the data, calculate the power zo using equation 0. An es- 
timate of the false alarm probability is then F ~ A/Prob(z > zo) 
(eqs. [5] and [6]), where Prob(z > zo) is the probability distribu- 
tion given by equation Js}. The number of independent frequencies 
is roughly M ~ TAf, where T is the duration of the observations, 
and A/ the orbital frequency range searched during the fit. 

Figure|2|shows the FAP as a function of A'^ for different values 
of Xmcan /XKcp- Solid curvcs are for M = 3000, dotted curves are 
for M — 1000. This Figure can be used for a quick assessment 
of the FAP associated with a particular Keplerian fit. For example, 
consider a set of radial velocity measurements with A*' = 30 and 
116 (or a reduced x^ of 4), and with the best fitting 



Xmcan 



Keplerian having Xkc 



Figure|2|shows that in this case, with A'^ : 
4, the false alarm probability is ~ 5 x 10" 



30 (reduced x of 1.2). Inspection of 



30 and Xmcan /XKep 



3 A BAYESIAN APPROACH TO THE PERIODOGRAM 

In this section, we show how to understand the Lomb-Scargle pe- 
riodogram in terms of basic probabilities, i.e. a Bayesian approach. 
This leads naturally to the Lomb-Scargle statistic for sinusoid fits, 
and gives a different way to think about the number of independent 
frequencies. A comprehensive and detailed discussion of these is- 
sues can be found in a series of papers by Bretthorst (Bretthorst 
1988, 2001a,b,c) on the application of Bayesian techniques to spec- 
tral analysis, as well as Scargle (2002) and Loredo & Chemoff 
(2003). In addition. Ford (2003) discusses the advantages of using 
likelihood functions to calculate uncertainties in orbital parameters. 
Those readers interested only in the practical results on detection 
thresholds should skip ahead to fj4] 



3.1 The Likelihood Function and Detection Threshold 

We start with the likelihood functions with and without a signal 
present (Sivia 1996). For a sinusoid fit to the data with parameters 
a = {K, (f), P, c), the probability of the data given the sinusoid 
model is 



Pid\K,P,(b,c) 



exp 



Xcirc 



iK,P,^ 



(9) 
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which comes from drawing each observed velocity from a Gaussian 
distribution. Here, Xcirc is not the minimum value of , but is the 
value of for a particular choice of the parameters [K, P, (j), c). 
Since we also have Keplerian fits in mind, we generalize to m pa- 
rameters, and marginalize or integrate over them to obtain the prob- 
ability of the data given the presence of a signal, 



P{d\l) 



dm -t 
a- 



1 



■ exp 



Xcir 



Similarly, without a signal present, we may write 
1 



(10) 



P(d|0) = I dc 



■ exp 



Xmcan 



where we integrate over the constant term c, the only parameter in 
this case. In these expressions, the notation "1" or "0" indicates the 
presence or absence of a planet, and "d" represents a particular set 
of radial velocity measurements. 

We now follow the detection theory approach of Wainstein & 
Zubakov (1962). We write the total probability of the data as 



P{d) = P{l)P{d\l) + P{0)P{d\0), 



(12) 



where P(l) and P(0) are the prior probabilities that there is or is 
not a signal present, and we assume here that the only possibilities 
are that zero or one planet is present in the data. In this frame- 
work, the false alarm probability is the probability that there is no 
signal present given the data, P{0\d). Combining Bayes' theorem, 
P(Ojd) = P(0)P{d\0)/P{d), with equation (T2j, this is 



F = P{0\d) = 



1 



1 + A' 

where we have defined an odds ratio 
P(l|d) P(l)P(d|l) 



A 



P(0|d) P(0)P(d|0)' 



(13) 



(14) 



For A ^ 1, the false alarm probability is the inverse of the odds 
ratio, F — P(0|d) « A~^. Therefore, the detection threshold cor- 
responds to a critical value of A. As in the usual frequentist appli- 
cation of the periodogram, the choice of detection threshold rep- 
resents a compromise between the desired number of false alarms 
versus false dismissals of real signals (see Appendix of Wainstein 
& Zubakov 1962 for a detailed discussion of this issue). 



3.2 Evaluation of the Odds Ratio and Relation to the LS 
Periodogram 

We now calculate A and show that it has a direct relationship to 
the periodogram. The integrals in equations <10> and <1 1> may be 
evaluated by expanding the integrand around its maximum in terms 
of the parameters a. We consider the two cases in which the noise 
a is known or unknown. In the first case, expanding near its 
minimum gives ~ Xmin + 5a ■ a ■ 5a, where the curvature 
matrix 



1 d'^x' 



2 daidaj 



(15) 



is the inverse of the correlation matrix of the x^ f^U C — 
(e.g. Press et al. 1992). The integral over the parameters is then a 
standard multidimensional Gaussian integral, giving 



P{d\l) = 



1 



/2^) (jJV Vdeta 



exp 



Xci 



rc,min 



(16) 



where v = N — m. Evaluating P(d|0) in a similar way, and taking 
the ratio, gives 



>/2 



detCi 



det Co 



1/2 



P(l) 

P(0) 



exp 



2 



(17) 



where Co (Ci) is the covariance matrix of the fit without (with) 
a signal present. Am is the number of parameters describing the 

signal, and Xmean,min Xcirc,min- 

When a is unknown, we integrate or marginalize equation ilQ\ 
over a, taking limits of integration to be to oo. This gives 



(11) P(d|i)(x / d'"d (xLc) 



-(N/2) 



(18) 



with a similar result for equation <1 It . The integrals over parameter 
space are performed as previously, by making a Gaussian approxi- 
mation near the peak of the integrand. This gives 



A = (27r) 



Am/2 



detCi 



det Co 



P(l) (xi 
P(0) 



Cmcan 
A.circ 



I//2 



(19) 



M = (27r 



givmg 

F^K- 



To see the relation of this result to the periodogram, we write 
det Co 1^''^ P(0) 



-Am/2 



detCi 



P(l)' 



M Prob(2: > zo), 



(20) 



(21) 



identical to equation (6), with Prob(2 > zo) given by either equa- 
tion js} or J4} depending on whether a is known or unknown'' . 

For sinusoid fits, we have recovered the LS periodogram, but 
with a new interpretation of M. For independent parameters, the 
covariance matrix is diagonal, so that det C ~ . 5ai, where 5ai 
is the uncertainty in parameter i (e.g. Press et al. 1992). For circular 
orbits, this gives an estimate 



M 



AP Ait" P(0) 



(22) 



5P 5Kd(t> P(l) 

The first term is the estimate for M that we gave in §2.2, the range 
of frequencies searched divided by the frequency resolution. How- 
ever, we see that in a Bayesian approach, M also includes the range 
of amplitudes and phase considered, and the prior probability of 
a signal being present. This corresponds to a different picture in 
which rather than making many trials searching for the lowest x^, 
we have instead integrated over all possible values of the param- 
eters, weighting each choice by its relative probability. For fits of 
Keplerian orbits, the probability distribution of Ze is given by equa- 
tion Js}, which is similar to but slightly different from equations 
<17t and <19t . The significance of this difference is not clear 

The most useful application of this method is as a powerful 
tool for population analyses. Here, we have discussed the choice 
between detection and non-detection so that the connection to 
the periodogram could be seen. However, the advantage of the 

■* There is a small difference, which is that v = N — i here for a circular 
orbit rather than N — 3. The loss of one degree of freedom comes about be- 
cause of the integration over period. We also mention here that this deriva- 
tion of the probability distributions applies to either choice of periodogram 
normalization. Using an analysis of variance approach, Schwarzenberg- 
Czemy (1998) showed that normalizing the periodogram by the variance 
of the data or by the variance of the residuals was statistically equivalent, 
but led to a different form for Prob(2 > zq) in each case (see Table 9 in 
Gumming et al. 1999). In fact, both these distiibutions ai'e a direct rewrite 
of the last term in equation 1191 . using the respective definition of z. 
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Figure 3. Signal to noise ratio K/cr that can be detected with N observa- 
tions, and 99% (upper curve) and 50% detection efficiency (lower curve). 
We assume short period (P < T) circular orbits, and use the LS peri- 
odogram. The solid lines show the analytic result for M/F = lO". The 
points show numerical calculations for data sets with the indicated values 
of A'^, and a realistic spacing of observation times. The dotted lines show the 
result when the noise level is known in advance. The dashed curve shows 
the 50% detection efficiency curve for a test for excess variabiHty. 

Bayesian approach is that no decision regarding detection needs 
to be made. Instead, both possibilities can be included, with the 
probability of each calculated as P{l\d) and P{0\d). For exam- 
ple, integrating equation j9) over the "nuisance" parameters and 
c gives Prob(P, K\d) for each star. This can be used to study the 
underlying mass and orbital period distribution. Further integration 
of Prob(P, K\d) gives Prob(Pjci), equivalent to the periodogram, 
or Pioh{K\d) from which limits on K can be derived. Extension of 
this approach to eccentric orbits, or multiple planets, is straightfor- 
ward in principle, but in practice involves more complex integrals 
(see Ford 2003 for a suggestion of how to evaluate them). 



4 VELOCITY THRESHOLDS 

In this section, we use the LS periodogram and the Keplerian peri- 
odogram discussed in §2 to derive analytic expressions for the ve- 
locity thresholds. We discuss orbital periods P shorter and longer 
than the duration of the observations T separately. 

4.1 Short Periods (P < T) 

We first consider circular orbits, and use the LS periodogram. Fig- 
ure |3| shows the signal to noise ratio K/a required for 50% and 
99% detection probability as a function of the number of observa- 
tions. The squares show numerical calculations for sets of A*' obser- 
vations with realistic spacing between observation times. For each 
set of observation times, we use a Monte Carlo method to determine 
the detection threshold, generating sequences of velocities drawn 
from a Gaussian distribution, and finding the power exceeded in 
0.1% of trials (the 99.9% detection threshold). We search for peri- 
ods between 2 days and 10 years. We then make fake data sets with 



Q 
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Figure 4. Detection efficiency (DE) as a function of eccentiicity for a time 
series with N = 16 (top panel) and = 39 (bottom panel). We use a 
period P = 100 days, and show signal to noise ratios K/a = 2, 5, 10, 
and oo (top panel) and K/cr = 2, 10, and oo (bottom panel). Dotted lines 
show the results using the LS Periodogram (fitting sinusoids); solid lines 
use Keplerian fits. 

increasing signal amplitude K until the signal is detected in either 
50% or 99% of trials. 

To obtain an analytic estimate, we first write down the detec- 
tion threshold Zd corresponding to a critical false alarm probability 
F (where F <ti 1). Equations ^3} and ^6} give 




for a given number of independent frequencies M. When a signal 
of amplitude K is present, the periodogram power Zs has a distri- 
bution of values because of noise fluctuations. For large signal am- 
plitude or A^, the distribution of periodogram powers when a signal 
is present is Gaussian with mean (zs), and variance 2{zs) (Groth 
1975), where (zs) = {v/2){K^/2a^) (Scargle 1982; Home & 
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Figure 5. Examples of velocity curves with e = 0.5 that are (top panel) 
and are not (bottom panel) detected. The dotted line in each case shows the 
true orbit; the points are the observed velocities; and the solid curve shows 
the best fitting orbit. In both cases, the solid curve gives a lower than 
the dotted curve. The lower panel has only a single measurement during the 
periastron passage, and is not a significant detection. 
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Figure 6. The effect of eccentricity on the velocity threshold for TV = 39. 
The dotted curves are for the LS periodogram; the solid curves are for Kep- 
lerian fits. The rapid increase in the 99% detection efficiency (solid) curves 
at e Si 0.8 is due to the fact that very eccentric orbits are not always de- 
tected even when the signal to noise is very large. 



Baliunas 1986). In this limit, the probability of detecting a signal 
with mean amplitude (zs) for a given detection threshold Zd is 



Pdctcct {{z s) Zd) ~ — 



1 + erf 



(Zs) - Zd 



(24) 



Setting Zd = (za) gives the signal to noise ratio needed to detect 
the signal 50% of the time, 

or, for large A'', 



4 In 



1/2 



{N » 1), 



(26) 



which shows the expected 1 / ^/N behavior^ . The solid lines in Fig- 
ure|3|show this analytic estimate for M/F = lO". We include a 
multiplicative factor of 1.7 for the 99% detection probability curve 
(this factor is given by equation <24> with Pdctcct ~ 99%). 

Figure|3|shows that A'^ > 10-20 is required to be able to de- 
tect an orbit with K ~ 2-4cr, with A'^ > 50 required to reach 
amplitudes as small as if ~ a. How does that compare to the case 
where we know the noise level a? The appropriate formula is then 
equation <26> for all A*'. The dotted curves in Figure |3| show the 
50% and 99% detection probability curves in this case. Knowing 
the noise level in advance gives a significant improvement, allow- 
ing a detection of the signal for A'^ < 10. The sparse sampling 
of the data may increase the detection threshold somewhat over 
this estimate (Nelson & Angel 1998). It is also interesting here to 
compare the detection threshold of the LS periodogram with a 
test for variability. The dashed line in Figure|3|shows the signal to 
noise ratio that gives a exceeding the expected value 50% of 
the time, assuming that the noise level cr is a known quantity. Here 
we choose the detection threshold to correspond to a FAP of 0. 1%. 
Excess variability at the ~ 2a level is apparent with only a handful 
of observations. 

We next discuss the effect of non-zero eccentricity. Figure |4] 
shows the detection probability as a function of eccentricity for sig- 
nal to noise ratios of K/a = 2, 5, 10 and ex, where oo means that 
we sample the velocity curve without adding noise. The top panel 
shows results for a data set with 16 observations, the lower panel 
for 39 observations. The dotted curves show the detection probabil- 
ity using the LS periodogram. The solid curves show the detection 
probability using the Keplerian periodogram defined in equation 
0. Not surprisingly, the LS periodogram fails to detect orbits with 
large eccentricities, particularly for small A'^, whereas fitting Kep- 
lerian orbits increases the detection probability for eccentric orbits. 

Figure |4| shows that even when fitting Keplerian orbits, de- 
tectability decreases for e > 0.5. There are two reasons for this. 
First, even for K/a 3> 1, we find detection efficiencies < 1 (Ri 80- 
90%) for e = 0.9. In the cases that are not detected, the Keplerian 
fitting routine fails to find the correct solution. This emphasises 
the difficulty of finding the global minimum in the complicated 
space for these nonlinear solutions. The second effect, dominating 
at lower K/a, is the uneven sampling of the data, which can lead to 
a poorly-resolved periastron passage. An example is shown in Fig- 
ure |5| in which we show two sets of observations of an orbit with 
e — 0.5. Both fitted lightcurves (solid curves) have a lower than 
the true solution (dotted curve), but whereas the data in the upper 
panel lead to a detection, the data in the lower panel do not. In the 
lower panel, only a single measurement has been made during the 
periastron passage. This greatly reduces the Ax^ when the Keple- 
rian orbit is included in the fit. An additional danger is that a single 
discrepant data point might arise due to a systematic error, perhaps 
making the fit in the lower panel of Figure|5]worrying in a real life 
example. 

Figure |6| summarizes the effect of eccentricity. We plot the 
signal to noise ratio needed for a detection efficiency (DE) of 50% 
or 99% as a function of eccentricity. The solid curves are for the 
Keplerian periodogram, and the dotted curves are for the LS peri- 
odogram. The rapid increase in K/a for the 99% DE solid curve 



5 To derive this limit, write (M/F)"^!" = 1 + x, where x < 1. 
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Figure 7. Velocity thi'eshold for a star with A'^ = 39, and detection ef- 
ficiency 50% (lower curve) and 99% (upper curve). The vertical dashed 
line shows the duration of the observations (si 2600 days). The detection 
threshold is set at 99% significance for a search down to 1 day period. The 
dotted lines show the analytic result for M = 5000 and F = 0.01. The 
50% curve scales as K oc P for P > T; the 99% curve scales as K oc P^ 
for P < (7r/8Fd)r fa 40T, and K (x P thereafter. 



at e ~ 0.7 is due to the failure to detect even high signal to noise 
orbits for e ~ 1. For e < 0.6, the effect of eccentricity on the 
amphtude needed for detection is small. 

We have not yet discussed the dependence of the velocity 
threshold on orbital period. Figure Q shows the amplitude needed 
for 50% and 99% DE for iV = 39 as a function of period. We dis- 
cuss the long period behavior (P > T) in the next section. The dot- 
ted lines are the analytic result for M/F — 10^, and compare well 
with the numerical calculations. This Figure shows that the ampli- 
tude threshold is not very sensitive to period for P < T. There is 
some loss of sensitivity at periods related to a month and a year, and 
particularly at P = 1 day, introduced by the time sampling of the 
data, but this is a small effect even for A'^ ~ 10. This insensitivity 
can also be seen in the curves of Walker et al. (1995), Nelson & An- 
gel (1998), and CMB99. It is a result of the uneven sampling which 
gives good phase coverage for many frequencies (Press et al. 1992). 
Therefore the velocity threshold A'o characterizes the short period 
detectability for most periods < T. 



4.2 Long Periods (P > T) 

At long periods, the detection sensitivity drops because the obser- 
vations cover only part of an orbit. For circular orbits, Eisner & 
Kulkami (2001) pointed out that there is a correlation between the 
fitted phase and amplitude at long periods, and derived analytic 
expressions for the velocity threshold. Here, we adopt a slightly 
different approach. It is useful to think of "sine-like" and "cosine- 
like" phases, depending on whether the orbit is close to a velocity 
maximum/minimum or to a zero crossing. These two cases are il- 
lustrated by Figure [S] For T <C P, the velocity variation due to 
the signal is Av = Ksm{2-KT/P) ^ K{2tvT/P) for a sine-like 
variation, or Av = Kcos{2tvT/P) ^ {K /2){2tiT / Pf for a 



Figure 8. Example of "cosine-like" and "sine-like" velocity variations for 
observations covering 1/10 of an orbital period . For the cosine case (up- 
per panel), the variation in radial velocity is Af fa {K /2){2itT / P)"^ ; for 
the sine case (lower panel), the variation is A.v fa K{2ttT/P). If Ko is 
the velocity amplitude that can be detected at short periods, the detection 
threshold in each case is given by Aii = 2Ko (see text). 



cosine-like variation. Setting Aw = 2Ko then gives an estimate for 
the velocity amplitude needed for detection. 

Averaging over phase introduces slightly different numerical 
factors. We find that a good approximation to the velocity threshold 
is 

^- sin(.r/2P) -^°(g) 



(27) 



for e_D < 3/4, or 



T < P < 



8(l-eD) 



P> 



(28) 



8(1-61,) 



ir _ ) l-cos(7rT/2P) ~ XtvI 

for en > 3/4, where e_D is the detection efficiency. The scaling 
K oc P corresponds to Mp oc a? ; the scaling K (x P^ corresponds 

to Mp oc a^''^. 

The reason that the scalings depend on the detection effi- 
ciency is that a sine-like phase gives a larger A?; than a cosine-like 
phase. So for a 50% detection threshold (e_D ~ 0.5) the ampli- 
tude must be large enough that sine-like phases are detected, but 
cosine-like phases do not have to be. For a 99% threshold how- 
ever (e_D = 0.99), almost all phases must be detected, requiring a 
large amplitude. Eventually, at very long periods, almost all phases 
are sine-like, and the scaling changes to oc P once more. Figure 
^compares the analytic and numerical results at long periods. For 
this particular example, the change in scaling of the 99% threshold 
from oc P^ to oc P can be seen at P ~ 10"' days. 

In their investigation of detectabilities. Nelson & Angel (1998) 
adopted the square of the best fitting amplitude as their test 
statistic. Eisner & Kulkarni (2001) pointed out that this leads to re- 
duced sensitivity at long periods, because when fitting sinusoids to 
noise only, one finds that the amplitude is strongly correlated with 
the fitted phase. The cosine-like phases have much larger fitted am- 
plitudes than the sine-like phases, and so the velocity threshold has 
a K oc P^ scaling. Indeed, our results nicely explain the empir- 
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Figure 9. Ratio of the 50%, 90%, and 99% velocity-squared thresholds to 
the cosine like scaling K'^ = 4:Kq/{1 — cos{ttT/2P))^ . Comparison of 
this plot with Figures 8, 9, and 10 of Eisner & Kulkami (2001) shows that 
the LS periodogram and amplitude-phase analysis have equal sensitivities 
for long periods. The strong deviation for the 50% curve is because this 
curve follows a sine-like scaling (eq. 1271 1 rather than cosine- like (eq. l28l ). 





1 1 1 1 1 1 1 

P>T 










DE = 99% "X^ 


















DE = 50% 


" --. TXv^^ 

\ 

\ \ 








^ \ 
^ \ 

1 ,\ , 1 ,\ 





0.2 0.4 0.6 0.8 

Eccentricity 



Figure 10. For the same data set as Fig.0 and an orbital period of 50 years, 
the effect of eccentricity on the signal to noise needed for detection 50% 
(lower curves) or 99% (upper curves) of the time. The solid curves are for 
the Keplerian periodogram; the dotted curves are for the LS periodogram. 
The dashed curves show an analytic (1 — e)^ scahng. 

ical scaling found by Nelson & Angel (1998), K oc {P/f3T)°', 
with a — 1.86 and (3 ~ 1.45. Our formula gives a = 2 
and /3 = tv/2 = 1.57. Eisner & Kulkami (2001) adopted an 
"amplitude-phase" analysis to account for this correlation, in which 
the detection threshold is set by an ellipse in the K-(j> plane. They 
showed that the sensitivity was significantly improved. However, 



Figure 11. For data sets with = 16, the effect of eccentricity on the 
50% detection curves. We take e = 0, 0.3, 0.5, 0.7 (top to bottom at short 
periods). The dotted curves show the analytic estimates. The vertical dashed 
curve indicates the duration of the observations. 

their proposed method is unnecessarily complicated. The correla- 
tion between amplitude and phase is automatically included in the 
LS periodogram, which is much simpler to use since it is defined in 
terms of the single variable Ax^ • We have implemented the method 
of Eisner & Kulkarni (2001) and find that it matches the sensitivity 
of the periodogram at all periods. Figure |9| shows the ratio of the 
velocity thresholds from the periodogram to the cosine-like scaling 
of equation <28t . Comparison with Figures 8, 9, and 10 of Eisner & 
Kulkami (2001) shows that the LS periodogram has equal sensitiv- 
ity compared to the amplitude-phase technique. 

Walker et al. (1995) looked for long term periodicities by fit- 
ting a quadratic to the data v — a + bt + ct^ , and checking for a 
significant reduction in using an F-test. CMB99 adopted a sim- 
ilar approach but with linear fits to the data. How is this related to 
the long period sensitivity of the periodogram? In fact, for periods 
P > 2nT, the periodogram is no longer sensitive to the period. A 
quadratic fit gives a relation between the amplitude and period 

^-2^(^'+^Vj ' ^29) 

so that the best fitting velocity amplitude is determined for all pe- 
riods (> T) by the fit. We have checked the sensitivity to long 
period circular orbits of an F-test based on quadratic fits to the 
data, and find that quadratic fits reproduce the sensitivity of the 
periodogram for long periods. In the regime where the detectable 
amplitude scales oc P, a linear fit is adequate to detect the signal*". 
However, the false alarm probability used for the F-test must reflect 

^ It is well known, for example in pulsar timing (Joshi & Rasio 1997) that 
if an orbit can be measured precisely enough, the parameters of the orbit 
can be determined on a timescale much less than the orbital period by very 
accurate measurements of orbital derivatives. For circular orbit s, a cu bic fit 
V = a-\-bt+ct'^+dt^ gives a period measurement P = 2-Ky^b/6d which 
can be inserted into equation i29\ to find the amplitude K. This imphes that 
if we could make an accurate enough measurement of the radial velocity 
curve, the complete orbital solution could be determined. 
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the number of additional trials that are carried out at high frequency 
using the periodogram. For example, a search for long period or- 
bits only might adopt a FAP for the F-test of 1%. However, if short 
period orbits are also searched, and the number of independent fre- 
quencies for P < T is ~ 1000, then we should choose a FAP of 
- 10"^ 

Detection efficiency plots for long period eccentric orbits are 
shown in Figures [T0|and[TT] Fi gure Island the related discussion 
makes it straightforward to understand the detectability in this case. 
At very long periods, the number of parameters needed to describe 
the data is 1 or 2 as discussed above, and therefore the scalings with 
period are the same as the LS periodogram. However, the transition 
into the long period regime happens at longer orbital periods for ec- 
centric orbits, because of the distorted nature of the lightcurve. The 
width of the periastron passage is ~ (1 — ef'P, so that equations 
i27l and l28j still apply, but with the substitution r->T/(l-e)^. 
The dotted curves in Fieure fTTI show the analytic formula with this 
substitution. Therefore at long periods, eccentric orbits are more 
easily detected for a given K. 



5 SUMMARY AND DISCUSSION 

The main results of this paper are (i) a method for estimating the 
significance of a Keplerian fit to radial velocity data, and (ii) an 
analytic expression for the velocity threshold of single planets in 
terms of the number and duration of the observations, number of 
independent frequencies, and the required false alarm probability. 
For circular orbits, equation 125 > (or eq. 1261 for A'^ 1) gives the 
velocity amplitude threshold at short periods (P < T). Equations 
j27landj28l give the amplitude threshold at long periods (P > T) 
as a function of the required detection efficiency. At long periods, 
the 99% detection threshold scales as oc P^ oc a^, whereas the 
50% detection threshold scales as if oc P oc a^^"^. 

We presented a straightforward generalization of the Lomb- 
Scargle periodogram to Keplerian orbits, based on the improvement 
of x'^ when a Keplerian orbit is included in the fit, and discussed 
a Monte Carlo method to calculate the false alarm probability as- 
sociated with the fit. An simple analytic estimate of the false alarm 
probability is as follows. Given XKep from the Keplerian fit, and 
Xmoan from the fit of a constant to the data, first calculate the power 
zo using equation 0. An estimate of the false alarm probability is 
then FAP ^ AfProb(2; > zq), where M « TA/ is the number of 
independent frequencies (A/ is the frequency range searched), and 
Prob(2 > zq) is the probability distribution given by equation {S}. 
Alternatively, Figure|2|may be used to find the FAP for a given TV 
and Xmoan/XKop fatlo. This Figure should prove useful for a quick 
estimate of the FAP associated with a Keplerian fit. 

We used the Keplerian periodogram to investigate the effect 
of eccentricity on detectability. Eccentricity acts to make detection 
more difficult at short periods, where the uneven sampling often re- 
sults in inadequate phase coverage during the rapid periastron pas- 
sage of an eccentric orbit. At long periods, the increased velocity 
amplitude and acceleration near periastron increase detectability. 
The transition to the long period regime occurs for orbital periods 

We have also discussed the statistics of the Lomb-Scargle pe- 
riodogram, including a derivation of the periodogram from basic 
probability theory. This Bayesian approach gives a simple deriva- 
tion of the statistical distribution of periodogram powers for Gaus- 
sian noise, and clarifies the nature of different periodogram nor- 
malizations. The best statistic to use at both short and long orbital 
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Figure 12. Known companions (circles) and detection thresholds. 99% de- 
tection thi'esholds are dotted lines; 50% detection thi'esholds are the soUd 
lines. In each case, we make 5 observations per year for 3, 6, and 12 
years (N = 15, 30, and 60), and take cr = 5 m/s and M* = 1 Mq. 
From top to bottom, the short period amplitude thresholds are Kg = 
28, 20, 13, 9.5, 8.2, and 5.8 m/s. We assume M = (T/ days). The solid 
square shows the location of Jupiter. 
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Figure 13. Velocity amplitude and eccentricity of known companions. The 
curves show 50% (solid) and 99% (dotted) detection thresholds, for (top to 
bottom) N = 16, and A'^ = 39, assuming cr = 5 m/s. 

periods is Ax^, the improvement in x^ when the planet is included 
in the velocity fit. Using the square of the fitted amplitude (Nel- 
son & Angel 1998) results in decreased sensitivity at long periods. 
A seperate analysis of K and phase recovers this sensitivity at 
long periods (Eisner & Kulkarni 2001), but is unnecessary if Ax'^ 
is adopted as the statistic. 

Both Nelson & Angel (1998) and Eisner & Kulkarni (2001) 
argue for the superiority of a "least-squares" rather than "peri- 
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odogram" approach to this problem. Partly, this is based on a pref- 
erence for dealing directly with the parameters of the fit (ampli- 
tude K and phase (j) for circular orbits) and the resulting , rather 
than a "black box" periodogram. For example, the original LS pe- 
riodogram must be "modified" to include a constant term as a free 
parameter at each frequency (Walker et al. 1995; Nelson & Angel 
1998; CMB99), whereas this arises naturally when thinking about 
a fit of a model to the data. In addition, since the form of the 
signal is exactly known, i.e. a Keplerian orbit, the argument is that 
Fourier or spectral analysis of the data is not the most efficient way 
to look for the signal. We hope in this paper to have clarified the 
equivalence of the least squares and periodogram approaches when 
Ax^ is used as the test statistic. 

We have considered only single planets in this paper, whereas 
multiple systems of planets are common. In cases where planets are 
well-seperated in period or in amplitude, our results may be appli- 
cable. For example, a linear trend in the velocities is often included 
in orbit fits to subtract any long term velocity variations due to a 
long period companion. Our results apply to this case if the num- 
ber of degrees of freedom is reduced by I, so that TV — 1 becomes 
iV - 2, iV - 3 becomes iV - 4 etc. (see Walker et al. 1995; CMB99 
for discussion of linear and quadratic "background models"). Of 
course, this correction is only relevant for small A'^, for large A*", 
the results carry over directly. Another case in which two planets 
are close in period, but well-seperated in amplitude is discussed by 
Narayan, Gumming, & Lin (2004, in preparation). 

In many situations when looking for a periodic signal, it is 
possible to detect signals with amplitudes much less than the back- 
ground noise level. In the case of planet searches, this is not the 
case, because of two factors. The first, which we have discussed 
in this paper, is statistical. The small number of observations limit 
the detectable amplitude to ~ 2-4 a for A'' « 20-30 (e.g. see 
Figure 1). Here, o refers to a combination of measurement errors, 
both statistical and systematic, and intrinsic stellar "jitter". When 
the number of observations is less than ^ 10, it is impossible to 
characterize an orbit, as was previously pointed out by Nelson & 
Angel (1998). Detection of signals < 1 cr requires N > 50. 

The second factor which limits detectability is uncertainty sur- 
rounding the stellar jitter. This may arise from convective inho- 
mogeneities, or rotational modulation of magnetic features on the 
surface (Saar & Donahue 1997), all processes with characteristic 
timescales comparable to extrasolar planet orbital periods. There- 
fore, although the magnitude of the stellar jitter can be estimated 
based on stellar properties (Saar et al. 1998), its time variability is 
a significant source of uncertainty. Planet detections with K < a 
require a much better understanding of jitter. Observations of mag- 
netic activity indicators simultaneous with the radial velocity mea- 
surements offer some hope of correcting for these effects (Saar & 
Fischer 2000; Paulson et al. 2002), but this work is in its early 
stages. Improvement in measurement errors to the ~ 1 m/s level 
will help to disentangle systematic errors and stellar jitter effects. 

Understanding the distribution of planet orbital period, mass, 
and eccentricity at low masses, long orbital periods, and large ec- 
centricities requires careful analysis of the radial velocity data for 
each survey (e.g. Walker et al. 1995; Gumming et al. 1999; Gum- 
ming et al. 2003; Santos et al. 2003). Nonetheless, it is interesting 
to compare the detection thresholds we find in this paper with the 
observed planet properties. For a mass Mp, the velocity amplitude 
is 

28.4 m/s /MpsiniX / P ( \ 



where P is the orbital period, Mp is the mass of the planet, and A/* 
is the mass of the star. In Fi gures fT2l and l 1 3 1 we show the mass, peri- 
ods, and eccentricities of known planets compared to the detection 
curves. In Figure fT2l we show curves of 50% and 99% detection 
efficiency for a = 5 m/s and 3, 6, and 12 years of observations 
with 5 observations per year. These curves roughly match the ob- 
served cutoffs at low masses and long periods. Jupiter's position is 
indicated by a black square for comparison, indicating that Jupiter 
analogs will be detectable in the near future. In Figure fTsl we show 
50% and 99% detection efficiency curves for short period orbits, 
and compare them with the observed K-e distribution. The lack of 
observed highly eccentric orbits is possibly due to physical effects 
which limit their survival (e.g. Ford, Havlikova, & Rasio 2001). 
However, our results emphasise that there remain significant se- 
lection ejfects against eccentric orbits for e > 0.6, and the small 
number of highly eccentric planets discovered so far may reflect 
this. In this regard, it is worth noting that the most eccentric orbit 
discovered so far, with e = 0.93 (HD 80606; Naef et al. 2001), also 
has one of the largest amplitudes /iT = 411 m/s(A/sini = 4A/,/). 
Future observations may reveal lower mass planets in highly eccen- 
tric orbits that have hereto gone undiscovered. 
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